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APPARATUS AND METHOD FOR STORING 
DATA WITH SELECTABLE DATA 
PROTECTION USING MIRRORING AND 
SELECTABLE PARITY INHIBITION 

5 

CROSS REFERENCES TO RELATED 
APPLICATIONS 

The present patent application is related to: 

U.S. Pat. No. 5257367 to Goodiander et al. for a Data 10 
Storage System With Asynchronous Host Operating System 
Communication link, filed on Jul. 17, 1990 as U.S. patent 
application Ser. No. 544,127 and issued on Oct 26, 1993 
and assigned to the assignee of the present patent applica- 
tion; and to 15 

FIELD OF THE INVENTION 

The present invention relates to an improved system for 
storing data in a mass storage system associated with a data 
processing system and, in particular, a method and system 20 
for storing data with selectable levels of protection. 

BACKGROUND OF THE INVENTION 

The increased power and speed of contemporary com- ^ 
puter systems has resulted in a corresponding demand for 
significantly increased mass storage capability because of 
die increased volumes of data dealt with by the systems and 
the increased size of the operating system and applications 
programs executed by such systems* There are a variety of 30 
mass storage devices available, such as disk drives, optical 
drives and tape drives, and one approach of the prior art to 
the mass storage problem is the use of mass storage systems 
associated with the computer systems. An example of a mass 
storage system is described in U.S. Pat No. 5,257367, cited 35 
above, and may be taken as illustrative of this general 
approach to the mass storage problem Typically, such a 
mass storage system will include a plurality of relatively fast 
mass storage devices, such as disk drives, together with one 
or more slower but larger archival type storage devices, such ^ 
as a tape drive or optical drive, and a processor based storage 
system management unit sometimes referred to as a "disk 
platform", to manage and control the transfer and storage of 
data between the mass storage system and the associated 
host computer system. In many such systems, such as that 45 
described in U.S. Pat No. 5,257,367, the storage system 
management unit is connected to the host computer system 
in the same manner as a conventional disk drive, and 
through a conventional disk drive controller normally 
present in the host computer system, to appear to the host ^ 
computer system as a conventional disk drive, although with 
significantly increased storage space. 

A recurring problem with data storage, however, and 
increasingly so as the capacity of mass storage systems 
increases, is that of protecting the data from loss, for 55 
example, due to failure of a disk drive in a mass storage 
system, and a number of techniques have evolved in the 
prior art for protecting data in mass storage systems. One 
technique, for example, is the use of parity information, 
which is essentially an additional body of data that is $o 
generated from the stored data, usually as the data is being 
stored, and that contains information that may be used to 
reconstruct or correct lost or damaged data. 

Another technique is "striping", which is customarily 
employed as a technique for increasing the speed with data 65 
may be written to or read from the disk drives of a mass 
storage system by distributing a body of data across a 
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plurality of disk drives so that reads and writes from and to 
the disk drives with respect to segments of the data can be 
overlapped. Striping also serves to protect the data, in which 
role it is usually used together with parity, by storing of a 
body of data across a plurality of disk drives so that any data 
that is destroyed or damaged on one of the disk drives may 
be reconstructed from the remaining data and die parity 
information. An extreme embodiment of this technique is 
found, for example, in the Thinking machines Corporation 
CM-2 system which operated with 39 bit words, each 
containing 32 data bits and 7 parity bits, and stored the bits 
of each word in parallel across 39 disk drives, with one bit 
of each word being stored on each drive. 

A third commonly used technique of the prior art, which 
is again often used with parity data, is * t mirroring w wherein 
the entire body of data and parity information stored in the 
mass storage system is duplicated on a second set of disk 
drives, which may reside in a second mass storage system. 

These techniques for data protection of the prior art 
however, have not been entirely satisfactory for a number of 
reasons. For example, the combination of striping with 
parity as implemented in most mass storage systems will 
generally provide adequate protection in the event a failure 
of a single disk drive in a mass storage system, but will 
generally not allow data reconstruction or recovery if two or 
more disks fail. Also, while it is possible to construct mass 
storage systems using striping and parity that will allow 
recovery from multiple disk drive failures, this approach 
requires either increasing the granularity of data storage, that 
is, providing more disk drives so that smaller blocks of data 
are stored on each drive, or significantly increasing the 
amount of parity information, again requiring more disk 
storage space, or both. As a consequence, the cost of the 
mass storage unit soon becomes impractical and the perfor- 
mance of the mass storage system, that is, the rate at which 
data may be written and read, may suffer. 

Mirroring the entire body of data parity information 
stored in a mass storage system, in turn, immediately 
doubles the number of disk drives required to store the data 
and parity information and, again, the cost of the mass 
storage unit soon becomes impractical and the performance 
of the mass storage system, that is, the rate at which data 
may be written and read, may suffer. 

Yet another problem with mass storage systems of the 
prior art is that the generation of parity information and the 
writing of the parity information into fee disk drives sig- 
nificantly increases the time required to store incoming data 
in the mass storage system. The additional time required to 
generate and store parity information may, in turn, be 
unacceptable in certain circumstances and may actually 
result in a failure to acquire and store the data, for example, 
when large volumes of data are being received at short 
intervals or when the data is available only for limited 
periods, as when data is being downloaded from satellite a 
link. 

The present invention provides a solution to these and 
other problems of the prior art 

SUMMARY OF THE INVENTION 

The present invention is directed to a mechanism and 
method for use in a mass storage system for providing user 
selectable levels of protection against data loss or damage. 

According to the present invention, the mass storage 
system includes a plurality of mass storage devices for 
storing data and parity information wherein the system 
includes a host processor including memory and disk man- 
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agement facilities and a disk platform connected from the disk drive the parity blocks corresponding to each of the data 

host processor and controlling a plurality of disk drive units blocks written into the disk drives without corresponding 

comprising the mass storage devices, a protection mecha- parity blocks. 

nism providing user selectable levels of protection against According to the present invention, the selectable mirror- 

data loss - 5 ing and selectable generation and storage of parity informa- 

The protection mechanism includes the plurality of disk tion may be implemented independently of one another, or 

drives for storing data in data blocks and parity information together in a single system, and either may be controlled 

in parity blocks in storage segments of the disk drives. In this directly by user input 

regard, both the data and the parity information may be The data protection mechanisms of the present invention 
distributed across the disk drives according to any of a 10 thereby allow a user to selectively tailor the level of pro- 
number of methods, and some or all of the disk drives may tection in a mass storage system to meet the changing 
be assigned to store bom data and parity information, or a requirements of operation and use of the system by provid- 
first plurality of disk drives may be allocated as data disk mg selectable levels of protection. For example, at the 
drives for storing data in data blocks in storage segments of lowest level the system may be operated without mirroring 
the data disk drives and at least one disk drive may be 15 and with parity generation and storing inhibited, thereby 
allocated as a parity disk drive for storing parity information providing the maximum rate of data reception and storage 
in parity blocks. 5 Ut me lowest level of protection. At the next level parity 

According to the present invention, the storage segments information may be generated and stored while the data is 

of the disk drives are organized into at least two functionally being received and stored, thus providing a higher level of 

separate logical units for storing data wherein each logical 20 protection but a lower data transfer rate. At the next level, 

unit includes storage segments on each one of the disk the user may select mtra-rnirroring, wherein data that is 

drives, and the parity information in each parity blockrelates selected and identified by the user to be additionally pro- 

to the data stored in a plurality of corresponding data blocks tected is written to logical units of the disk drives that have 

wherein each one of the corresponding data blocks are been selected and designated as mirrored logical units, 

located in a different one of the disk drives. 25 thereby providing a still higher level of protection for the 

A disk allocation mechanism stores an identification of at most critical or important data at the cost of some increase 

least one of the logical units to be mirrored, and a memory in storage space. Finally, the using the same mechanism as 

management mechanism controls operations of the disk provided for intra-mi^^ 

platform for writing data blocks and parity blocks into me for mirroring, the user may achieve full mirroring for all data 

disk drives. According to the present invention, the memory 30 stored in the system, but at the cost of significantly increased 

management mechanism is responsive to the identification storage space, 
of a logical unit for mirroring all data blocks written into the 

designated logical unit by writing a first copy of a data block DESCRIPTION OF THE DRAWINGS 

assigned to a first storage address in a designated logical unit 35 The foregoing and other objects, features and advantages 

into the assigned storage address in the designated logical 0 f (h e present invention wili be apparent from the following 

unit and writing a second copy of the data block assigned to description of the invention and embodiments thereof, as 

a storage address in a designated logical unit into a second illustrated in the accompanying figures, wherein: 

storage address in the diskdrives wherein the second storage FKJS. 1(a) to (/) show the historical development of the 
rt^!— rf!** ^ ^^wed with respect to the first storage address so ^ perforce 0 f computer systems; 

that the second storage address is located in a disk drive r . 

separatefrommedatadisk^ ™- P™«* a simplified block diagram of a 

address, and writing at least one parity block containing pnor art approach to a disk-based memory system; 

parity Mormation relating to the data blockinto a disk drive. FIGS. 1(h) and l(t) illustrate an improved system in 

In further aspects of the invention, the data blocks are 45 whidl me invention could be implemented; 

organized in the disk drives in a striped configuration so mat FIG, 2 is a simplified block diagram of a platform-based 

each sequentially addressed data block is located in a disk memory system that could implement the present 

different sequential one of the disk drives. invention; 

Another and independent aspect of the present invention FIG. 3 is a simplified block diagram of the disk memory 
provides user selectable levels of protection by allowing the 50 platform of the system of FIG. 2; 

writing data blocks to the disk drives without corresponding FIG. 4 is a simplified block diagram of the host interface 

parity information being stored in the disk drives. In this block of the disk memory platform of FIG. 3; 

aspect of the invention, the memory management mecha- HG. 5 is a simplified block diagram of the one of thedisk 

nism is responsive to assertion of a parity inhibit command con{ro Uer channels of the disk memory platform of FIG. 3; 
for f Writingonly data blocfe into Ae disk drives. 55 FIG. 6 is a simplified drawing employed for an example 

In a further aspect of allowing the user to select whether t0 show ^ ^ aS9Bamms operation of the system of 

parity information will be generated and stored when the 2* 

data is written into me disk drives, the memory management „ .„ . . 

mechanism is responsive to subsequent de-assertion of the HGS. 7A and 7B are diagrammatic representaUons of the 
parity inhibit command for detenniiimg which data blocks 60 operation of the present invention; and 

have been written into the disk drives without corresponding FIG. 8 is a block diagram of a portion of a mass storage 

parity blocks written into a disk drive, reading the data system implementing the present invention, 

blocks written into the disk drives without corresponding DESCRIPTION OF THE INVENTION 

parity blocks and all associated data blocks having parity DK>UUmON Or THE INVENTION 
information in a parity block corresponding to each of the «5 The following will first describe two implementations of 

data blocks written into the disk drives without correspond- mass storage systems and host computer systems in which 

ing parity blocks, and generating and writing into a parity the present invention may be implemented and used, the 
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second being a significant improvement over the first and cations program; that is, the applications program has no 
being the subject of U.S. Pat. No. 5,257367, and will then idea where the data is physically located. At system setup 

describe the present invention. time (or possibly subsequently through operator input), the 
A. General Description of Host Computer Systems With locations of the data is stored in tables (not shown) which are 

Associated Mass Storage Systems (FIGS. 1(a) through 1(h) 5 part of or accessible by the operating system 12. Knowing 

and 2 through 6) from the tables mat the requested data is on a particular disk 

A given system's overall performance capability is the drive 16 at a particular track between starting and ending 

result of the integration of a number of independent tech- sectors, the operating system 12 outputs a disk read request 

nologies whose individual growth in performance/capability on line 18 to the disk controller 14. The disk controller 14, 

over time differs markedly from each other. FIGS. l(a)to(g) 10 in turn, then issues a read request to the appropriate disk 

shows the historical performance of computer systems' drive 16 on its connecting line 20 which causes the read head 

underlying technologies. FIG. la shows the exponential (not shown) within the disk drive 16 to move to the 

growth in semiconductor component performance. The fac- designated track and then read data and output it to the disk 

tors behind this are well known and include advances in controller 14 on the line 20 from the starting sector to the 

process technology that increase circuit density and speed is ending sector. When the data has been received by the disk 

Shrinking geometry's and increased wafer yields combined controller 14 (into an appropriate cache/buffer memory, the 

with circuit design innovations mean semiconductor perfor- operating system 12 is informed by an appropriate signal on 

mance should continue its exponential growth. FIG. lb line 18. 

reflects the exponential growth in CPU hardware perfor- As can be appreciated, if one wants the operating system 
mance as measured in MIPS. CPU's are the direct benefi- 20 12 to do more, the programming of the operating system 12 
ciaries of semiconductors as well as circuit design improve- must get more complex. Given the present state of com- 
ments and architectural innovations such as massively plexity of the typical operating system and the capabilities of 
parallel CPU's. FIG. 1c shows the trend in performance for the average systems' computer programmer with respect to 
operating systems. OS performance is being flattened by such esoteric matters as re-entrant coding and "run any- 
several factors such as the additions of user interfaces, 25 where" coding, to ask the operating system to do more is to 
graphics support, and the sheer growth in size over the years, askfor trouble because of information handling bottle necks, 
which has made OS's one of the most voracious consumers There is also the problem of system overhead. If you ask 
of computer resources. FIG. Id shows the capacity/ the operating system to do more, it will add to the overhead 
performance improvements of disk drives. This curve could of the operating system and, therefore, to the overhead of 
be best described as "leap-linear." Disk drive device perfor- 30 every program which accesses it 
mance and capacities tend to grow linearly until a new For any given OS and computer system, implementation 
technological event occurs. Such events in the past have of any real time function will cause the OS to consume a 
been the introduction of sealed disk Winchester technology large portion of the computing resource, rapidly degrade the 
in the early 70s, the introduction of thin film heads and performance of the system from the user's perspective, and 
plated media in the 80s, and, in the 90s, the general 35 severely limit the work product computing potential, 
introduction of 5400 and 7200 RPM drives to cut latency As those skilled in the art will also readily recognize and 
delays and contact recording technology that may push track appreciate, even if the penalty of added overhead is made to 
densities to 100,000 per inch. FIG. le demonstrates the the operating system so as to achieve the convenience in 
performance of I/O systems. This curve represents the other areas, such an approach includes no means of ever 
composite effect of CPU and controller hardware, operating 40 reducing the added overhead. 

systems, and disk drives. It should be noted that the expo- File Maintenance, Management, and Archival Copy 
nential growth in semiconductor performance has not been (FMMAC) are tasks essential to the reliability, usability, and 
reflected in I/O system performance, which has seen near integrity of computer stored data. These tasks are now 
linear growth. FIG. If reflects a commonly understood performed by Operating Systems functions, separately run 
phenomena with applications programs that, over time, 45 applications programs, operator or system manager manual 
additions, changes, and maintenance to the program tend to intervention, or a combination of these techniques, 
lead to a decrease in its performance. FIG. lg demonstrates These FMMAC tasks almost always require a manual 
that, over all systems performance has showed continued operator decision to initiate and complete. Often they 
improvement, but at a much slower rate than its underlying require the computer system to be taken off-line and there- 
technologies. In fact, without hardware upgrades and 50 fore not available to users during the time it takes to 
improvements over all system performance declines in complete these tasks. Some larger, sophisticated Operating 
response to the performance of operating systems and appli- Systems allow a designated File to be taken off-line leaving 
cation programs. most of the computer resource available. However, manual 
FIG. 1(h) shows a typical prior art computer system intervention is still required to initiate file maintenance and 
employing disk drives for storage. The host computer 10 55 archival copy. 

(i.e. the one interfacing with the computer operators) Because these crucial FMMAC tasks rely on manual 

includes an operating system 12. As known to those skilled intervention, arbitrary circumstances and schedules, the pre- 

in the art, the operating system is a set of computer programs <h'ctabilily of these tasks being performed is low. This is 

that run continuously while the computer has its power on. especially true outside of centralized "Mainframe" Com- 

The operating system controls all the functions of the 60 puter centers (FMMAC tasks are typically performed here 

computer including requests for operating portions of the by a dedicated maintenance shift at great additional operat- 

memory, error response, and input/output (3/0) requests. The ing expense). However, most computers (and by extension, 

computer 10 has a disk controller 14 connected thereto and most computer stored data) are not located inside -computer 

the disk controller 14, in turn, is connected to four disk centers" or have the benefit of dedicated file maintenance 

drives 16. In use, an applications program (not shown) 65 staffs. Therefore the reliability, usability, and integrity of 

makes a request for data from the operating system 12. The most computer stored data now rests on human nature and 

location of the data is completely transparent to the appli- motivation and the dubious assumption that the circum- 



5,893,919 

7 8 

stances surrounding the computer system itself are immune given time and/or day each month (as in preparing a 
from intervening events such as device failures or rush jobs payroll), the logic of the computer 40 will learn that from 
mat take priority over FMMAC tasks. experience and begin loading the file from the near line 

Continuous duty computer systems such as Real Time storage apparatus 36 in anticipation of the expected request 
monitoring and control systems or On-line Transaction 5 so that when the request is made, the file is already read in 
Processing systems present additional barriers to FMMAC and available for use. Thus, the overall system performance 
tasks. In theory, no time is available to perform the tasks. In of the system 22 is not only improved over the prior art for 
reality, such systems or files are simply shut off for a given level of file location transparency to the users* but 
FMMAC. In "Critical Mission" applications (for example additionally, the overhead drops as the system learns the 
Nuclear Power Plant Monitoring and Control) me FMMAC 10 repeated patterns of use associated with its users. In 
problem is often abated by duplicating hardware systems at addition, whereas the prior art approach of FIG. 1(/) could 
great economic cost only do system and equipment diagnostics by taking the 

In the above-referenced patent the system of which is computer 10 off-line or by increasing the complexity and 
described next herein below as illustrative of the systems in overhead of the operating system 12 once again by having 
which the present invention may be embodied, a high-speed, is the operating system 12 perform such functions in a back- 
Mgh-capacity, fault-tolerant error-correcting storage system ground mode, the storage system and equipment diagnostics 
was disclosed which provides a solution for many of the are now performed by the computer 40 located within the 
above-described needs of modern computer systems (both disk platform 28 on a continuing and time-available basis 
military and commercial). As will be seen from the description which foUows, when 

An improved overall computer system employing both 20 the disk drives 16 have a fault or error, any errors and 
disk and near fine archival storage and typical of the systems problems found can be corrected or at least pin pointed for 
in which the present invention may be employed is shown in operator correction and replacement without taking the 
FIG. 2 where it is generally indicated as 22. The system 22 system 22 off line or causing any significant degrading of the 
has a host computer 10 containing an operating system 12 performance thereof. 

with its tables 24. There is also a console privilege interface 25 Having thus described a preferred overall system 
26 by means of which outside user consoles (not shown) can approach to storage systems, attention will now be turned to 
be used to access the host computer operating system 12. a preferred construction of the disk platform 28. That system 
There is once again a disk controller 24 since there is no employed individual disk channel controllers for respective 
change to the operating system 12 and the operating system ones of the disk drives 16 and each of the disk channel 
12 is set up to interface with the disk controller 24. Rather 30 controllers included a cache/buffer memory. All the disk 
than being connected directly to the disk drives 16 as in the channel controllers were controlled by control signals from 
prior art approach of FIG. 1(/), however, the single line 20 a common control computer and both data and the control 
fromthediskcontroller24isconnectedtoanintelligentdisk signals employed the same bus. As depicted in FIG 3 the 
platform 28. The disk platform 28 is then connected to preferred disk platform 28 of mis disclosure has been 
interface with the disk drives 16 through lines 30. 35 restructured to greatly improve the performance whue, at me 
Additionally, the disk platform 28 has a bi-directional con- same time, also substantially reducing the complexity 
nection 32 through a communications link 34 to the console whereby the reliability is increased and the implementation 
privilege interface 26. In the preferred embodiment as thereof is greatly simplified with attendant known benefits 
applicable for large-scale storage systems, there is also near In FIG. 3, line 18 is labeled as the "SERIAL INTER- 
line archival storage apparatus 36 connected to the disk 40 FACE" and line 20 is labeled as the "HOST INTERFACE", 
platform 28 through line 38. To perform within the system In a tested embodiment, the SERIAL INTERFACE of line 
22 of this invention, the near line storage apparatus 36 18 is an RS-232 interface and the HOST INTERFACE of 
should be of an automated variety selected from any of a line 20 is a SCSI (small computer system interface) inter- 
number well known in the art where off-line storage media face. This choice was as a result of the availability of 
are loaded for reading and writing on request by automated 45 commonly-used equipment for use in testing only and those 
mechanisms so as to avoid the necessity of operator inter- skilled in the art will readily recognize and appreciate that 
vention. me same techniques being described here by way of 

To accomplish its unique improvements over the prior art, example could be accomplished employing other hardware 
the disk platform 28 includes its own computing capability interface methods and apparatus known in the art, or yet to 
as represented by the computer block 40. As will be seen 50 be developed. In this regard, the improvements of mis 
shortly, the computer 40 may, in fact, comprise multiple invention are both universal and adaptable. The two lines are 
processing units; but, for the present it is sufficient to note connected to a interface and driver unit 42 which provides 
that the disk platform 28 is not the "dumb" controller 14 of the host interface, serial interface, and LCD display driver 
the prior art Employing the bidirectional connection 32 functions as indicated therein. The logic and apparatus of 
through the communications link 34 to the console privilege 55 interface and driver unit 42 is also connected to a display 44 
interface 26, the computer 40 can find the location of data The display 44 is not an absolute requirement- but, is 
from the tables 24 of the operating system 12. The location preferred so that messages can be provided to an operator as, 
of data within the disk drives 16 or the near line archival for example, in the event of a detected malfunctioning disk 
storage apparatus 36 is, therefore, transparent to both the drive 16 which has been removed from operation and should 
operating system 12 and the applications programs request- 60 be replaced in order to restore fill system capability 
ing it If requested data is located on the near line archival There are a plurality of disk controller channels 46 
storage apparatus 36, it can be retrieved automatically and connected to respective ones of a plurality of disk drives 16 
then be relayed to the operating system 12 just as if it was It is preferred for error detection and correction optimization 
on one of the disk drives 16. More importantly, the preferred that at least one extra disk controller channel 46 and asso- 
computer 40 is of the self learning variety which learns 65 dated disk drive 16 be up and rawing at all times to be used 
ttawgh experience. Thus, for example, if a particular file as a "hot" replacement in the event of a detected error or 
from the near line archival storage apparatus 36 is used at a potential problem. Thus, it is preferable to dynamically 
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substitute a disk channel and let the problem channel be disk controller channels 46 so that the multiple, low-level 

fixed by service personnel off-line. When repaired, mat command approach, which occupied valuable time on the 

channel can then become the hot replacement Each of the CPU bus 66, is eliminated The micro-processors 50 each 

disk controller channels 46 again has a cache/buffer memory contain firmware that not only performs the actual low-level 

48 associated therewith. Where the memory of the disk 5 command steps required to do disk transfers; but, in 

controller channels performed the dual function of a addition, performs continuing self-testing of the individual 

memory buffer into which to read and from which to write controller channel on a time-available basis to assure that the 

with respect to the disk drive connected thereto and a cache components are continuing to work properly. Should such 

memory, the memories 48 of this invention in general self-check indicate a potential problem, the CPU 60 then 

perform the buffer function. It is preferred that the memories 10 conducts far ^ lCT testin g mc subject disk controller chan- 

48 be sized to be involved in more man one transfer at a time nel 46 *° tf ^ " hot " s P are disk 16 or ^sk 

so that at least one stream of data can be read into the controller channel 46 should be employed while the mal- 

memory 48 while another stream of data is being written out ^^7,^ t™£v ° n ^ ***** 44 ^ 

of the memory 48. In many systems of the prior art, the disk ^J^Ti^rl^^' * • 

$ 1 u mc j u< * e ™V < ^^f m ^ power. Thus, me ^ me ^ 56 ^ blem of deali with ^ 

tiie disk controller channels had to be fed a stream of low arbitration on the XBUS 64 is also eliminated. That, of 

level commands , resulting in a large amount of non^data course, has many beneficial effects such as increased 

traffic sharing the bus with the data. That, of course, reduced reliability, increased throughput, and simplicity of irnple- 

the overall mroughput of data along the bus. In the "intel- 20 mentation. 

Kgent** disk controller channels 46, the disk controller chan- To get a better idea of the operation of the disk platform 

nels 46 are provided only with high level command signals 28 of this invention, we will now turn to FIGS . 4 and 5 with 

and accomplish the balance of the indicated read and write particularity. FIG. 4 shows further details of the interface 

functions internally with the micro-processors 50. Thus, the and driver unit generally labeled as 42 while FIG. 5 shows 

amount of non-data traffic on the CPU bus 52 is greatly 25 further details of one of the disk controller channels 46. With 

reduced, allowing more available time for the transfer of reference first to FIG. 4, there is an XBUS driver 66 

data— which, of course, should be the primary function connected to the XBUS 64 with a connector 68. There is also 

thereof for optimum system performance. a nost interface driver 70 (SCSI in the tested embodiment) 

A significant element of this application is the cache connected ^to line 20 back to the host computer 10 by a 

memory & control unit 54. While the memories 48 of the 30 connector 72 « As with the other elements, there is also a local 

disk controller channels 46 in the tested emtodiment may ^°-P roc f s ^ ™ to control the operation of the elements 

range from 8K bytes in size to I28K bytes in size, the cache ? f f 5f uaa£ ^ *?* ^- mut ^ The micro-processor 74 

memory 56 of the cache memory & control u^t It rSy *gS^^ 

ran<m L i<vf u^ c im trt rp, . * display 44) and a serial interface driver 78 (which is con- 

byteS f £ 256M ' ^ « nected to the serial interface on line 18). All the driver 

exemplary, however, and other sizes of memories 48 and 35 elements are well known to those skilled in the art and wS 

cache memories 56 may be used without changing the be chosen according to the type of device with which they 

structure or operation of the present invention. The cache must interface. The microprocessor 74 is also connected to 

memory & control unit 54 also includes its own micropro- the CPU bus 52 with connector 80. The heart of the interface 

cesser 58 and master error recovery logic 59. and driver unit 42 and most important part thereof is a pair 

Finally, the preferred disk platform 28 includes a central 40 of unidirectional FIFOs 82 and 84. Under the control of the 

processing unit (CPU) 60 with its own private memory 62. local micro-processor 74, FIFO 82 receives and transmits 

The CPU 60 is employed for cache management and system data from the XBUS 66 to the host computer 10. Similarly, 

control functions in a manner to be described shortly. As FIFO 84 receives and transmits requests and data from the 

shown in FIG. 3, the above-described components with the host computer 10 to the XBUS 66. Thus, bi-directional 

exception of the CPU 60 are interconnected to communicate 45 transmissions can take place between the XBUS 66 and the 

with one another by the XBUS 64 and interconnected to host computer 10. This is another feature of this embodiment 

communicate with the CPU 60 by means of the CPU bus 52. which improves the overall throughput of the system 22. 

As mentioned above, not only is the transfer of data The disk controller channels 46 depicted in FIG. 5 also 

between the memories 48 and the disk drives 16 done includes an XBUS driver 66 and a disk drive interface driver 

asynchronously as in the prior disclosure; but, additionally, 50 92 connected to the associated disk drive 16 with their 

transfers into and out of the cache memory 56 is done associated connectors 68, 94. Similarly, the local micro- 

asynchronously. Because of its size (Le. 16M bytes) the processor 50 is connected to the CPU bus 52 with a 

cache memory 56 can be employed for the transfer of connector 80. In addition to the data buffer memory 48, there 

multiple requests from the host computer 10 at one time. is a buffer address register 86 which controls the locations in 

Because of access times of the disk drives 16, lengths of 55 the memory 48 which are accessed and a data traffic sema- 

transf ers requested, and/or data portions already in the cache phore 88 which operates in a manner readily apparent to 

memory 56, the requests to read and write data from the those skilled in the art to control access to and from the 

CPU 60 to the disk platform 28 may not be responded to in memory 48 under the control of the micro-processor 58. 

the order received. As soon as a request is completed, no Thus, it is the data traffic semaphore 88 which actually 

matter its position in the request sequence, the CPU 60 is 60 inserts the addresses into the buffer address register 86. The 

informed that it is completed. Thus, throughput as seen by data traffic semaphore 88 must be present to prevent simul- 

th ^ S ^ mpUt 5 r 10 is gready improved. taneous access to the memory 48 by both the XBUS 64 (and 

The CPU 60 includes an embedded array disk operating elements connected therethrough) and the host computer 10. 

system 61 and employs its private memory 62 to keep track Such devices are well known and employed in the comput- 

of the contents of the cache memory 56 so that it can respond 65 ing art for that purpose as any attempt to simultaneously read 

optimally to requests from the host computer 10. The CPU from and write into a memory location can cause irreversible 

60 in mis system only issues high level commands to the errors. 
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The near line archival storage channel 100 is controlled in time-available basis in a background mode by the micro- 

the same manner as disk controller channel 46 through processor 58. The impact of this on the overall performance 

microprocessor 50 and cache/buffer memory 48 and con- of the system 22 should not be overlooked or minimized, 

tains the logic to control by way of control bus 101 the near Consider, if there are eight disk drives 16 containing the 

line archival storage 103 and its individual elements 104. 5 individual data bits of data and a ninth disk drive 16 

105 and 106 to read and write data by way of data bus 102. containing the parity bits associated therewith, if the parity 

Data read from near line archival storage 103 is held in bits are contained in an unused portion of the cache memory 

cache memory 56 or on disk drives 16 and is accessed by the 56 and only written to the ninth disk drive 16 when the eight 

host computer with sector numbers beyond the physical disk drives 16 are not otherwise involved in an actual data 

limits of disk drives 16 creating a virtually boundless storage 10 transfer, disk access time to access data is reduced by 

capacity. one-ninth or eleven percent. The savings in time could be 

Having thus described the construction and operation of even more substantial in an instance where one of the eight 

the system 22 in general, a more specific example of its disk drives 16 containing the data bits is rnalfunctioning and 

unique mode of operation will now be described with the parity bits are required to correct any errors in the data 

reference to FIG. 6. For simplicity, FIG. 6 depicts in 15 stream. 

simplified form only the cache/buffer memories 48 in the B. Description of a Preferred Embodiment of the Invention 

channels and the cache memory 56 as connected by the (FIGS. 7A, 7B and 8) 

XBUS 64. Assume that a request has been made by the host Referring to FIGS. 7A and 7B and 8, the present invention 

computer 10 to read data. The disk platform 28, of course, is diagrammatically illustrated therein using the system of 

knows (or can determine) the location of the data in the disk 20 FIG. 2 discussed above as an exemplary implementation of 

drives 16 through its above-described access to the tables 24 the present invention. 

in the host computer 10. According to fault tolerant As will be described in detail below, the present invention 
techniques, the data (and its parity bits) are spread across the is directed to data protection mechanisms of a mass storage 
disk drives 16. From the contents of its private memory 62, system and the methods of operation thereof to provide 
the logic in the CPU 60 knows the present contents of the 25 selectable mirroring of critical data and selectable genera- 
cache memory 56. Anything that is already in the cache tion of parity information. The operation of the protection 
memory 56 will not be reread, of course, which is the usual mechanisms of the mass storage system may thereby be 
function of any cache memory (Le. to eliminate redundant tailored to the specific operational needs of the user by 
and unnecessary disk accesses). The CPU 60 then issues providing selectable levels of data protection, 
hi^i level requests to the various disk controller channels 46 30 In one aspect of the present invention, the mirroring 
to have them retrieve the elements of the data from their mechanisms of a mass storage system operate to mirror 
locations on the disk drives. The requests also go to the selected data rather than all data stored therein thereby 
cache memory & control unit 54 so that it knows what is providing optimum use of the disk storage space and pro- 
going on. From there on, the collection of the data and its viding additional security for the selected data without an 
transfer to the host computer 10 is under the control of the 35 unacceptable increase in data storage requirements, 
micro-processor 58 in the cache memory & control unit 54. In another aspect of the present invention, the parity 
The micro-processor 58 assigns available buffer space (as mechanisms of a mass storage system allow parity informa- 
indicated by the dashed box 90) in which to receive the data tion to be generated and stored "off-line*', that is, after the 
of the request The data segments are asynchronously data has been received rather than while the data is being 
brought into the buffer memories 48 under the control of the 40 received, thereby significantly reducing the time required to 
rmcro-processors 50 as a function of the originally-issued receive and store data in circumstances wherein the time 
high level commands from the CPU 60. As the data seg- available to receive and store data is a critical factor or 
ments are received, the micro-processor 58 is advised by the limitation in use of the mass storage system 
micro-processors 50 over the XBUS 64. The micro- 1, User Selectable Intra-Mirroring 
processor 58 then asynchronously transfers the data seg- 45 First considering the mirroring mechanisms of the mass 
ments into their appropriate location within the assigned storage system of the present invention, the operation of the 
buffer space 90. When the entire data of the request is in the mass storage system of the present invention without "mir- 
assigned buffer space 90, the micro-processor 58 transfers it roring" is represented in FIG. 7 A. As illustrated therein, and 
to the host computer 10 through the FIFO 82 described as discussed above, a plurality of Disk Drives 16 are 
above. A write operation, of course, operated in much the 50 connected from and controlled by Disk Platform 28 to store 
same manner, except that data flow is in the opposite data and parity information, the present exemplary system 
™? n " te^g shown with three Disk Drives 16, respectively desig- 
While only shown in simplified representation in FIG. 6, nated as Disk Drives I6D4, 16D& and 16Dc, for storing data 
it may be appreciated therefrom and from a consideration of and at least one Disk Drive 16P for storing parity informa- 
the elements and their method of operation as described 55 tion. It will be understood by those of ordinary skill in the 
above mat the single cache memory 56 of substantial size as relevant arts, however, that this number and configuration of 
employed in this embodiment effect a vast improvement in disk drives is for fflustrative purposes only and that the 
simplicity and efficiency of operation (i.e. speed). Because number, assignments and configurations of Disk Drives 16 
of its size, the cache memory 56 will actually seIf-or*imize may be varied according to the needs, capacity and uses of 
the data it retains in cache over time so as to minimize the 60 the system. For example, not only may a greater or lesser 
number of data transfers required with the disk drives. In this number of disk drives be used, but both the data and the 
regard, it is important to recognize that the parity bits parity information may be distributed across a plurality of 
associated with data are contained in separately transferable disk drives, and at least some or all of the disk drives may 
locations on the disk drives 16. Thu s, over the course of time be u sed to store both data and parity information, rather man 
the most used data and their parity bits will be virtually 65 allocating or dedicating certain disk drives for storing data 
permanent residents of the cache memory 56 and will only and others for storing parity information. In the following 
be written to the disk drives 16 for back-up purposes on a example, however, and only for purposes of ease and clarity 
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of illustration and discussion, the data is shown as stored on For example. Parity Block (FB) 98ABC contains parity 

disk drives allocated for data storage and the parity infor- information relating to Data Blocks (DBs) 96A. 96B and 

mation is shown as stored on at least one disk drive allocated 96C. 

for parity information storage. Finally, and as generally indicated in FIG. 7A, the storage 

The storage space on Disk Drives 16D is organized into 5 space available in Disk Drives 16Da through 16Dc is 

segments of a predetermined, user selected size, such as 512 organized into one more logical partitions, referred to as 

bytes, 1024 byes, 2048 byes, or 4096 bytes, and so on, which Logical Units (LUNs) 100, wherein the mass storage 

are generally but not necessarily of equal size, and data is system, as described herein above, operates such that each 

stored on Disk Drives 16D in blocks, illustratively referred Logical Unit (LUN) 100 operates as a logical storage device 

to as Data Blocks (DBs) 96A through 96Z. Each Data Block io regardless of the number or physical configuration of the 

(DB) 96 occupies a segment of the disk drive storage space actual physical devices, that is. Disk Drives 16, comprising 

and may contain a number of bytes of data up to the size of each Logical Unit (LUN) 100. In the present example, and 

a segment of storage space and a given body of data may be for purposes of illustration, the storage space in Disk Drives 

comprised of or occupy one or more Data Blocks (DBs) 96. 16D is organized into two or more Logical Units (LUNs) 

As indicated, the segments of storage space and thus the is 100, designated in the present example as a Logical Unit 

Data Blocks (DBs) 96 are "striped" across Disk Drives (LUN) 100a and a Logical Unit (LUN) lOOfc, wherein each 

16Da through 16Dc and, in general. Data Blocks (DBs) 96 of Logical Units (LUN) 100 are assigned storage space in 

are written sequentially into Disk Drives 16Da through each of Disk Drives 16Da through 16Dc and are separately 

16Dc, with each sequential Data Block (DB) 96 being addressable to operate as logical devices for reading, writing 

located on a different one of Disk Drives l€Da through 20 and storing data. In addition, and as shown in FIG. 7A and 

16Dc. Therefore, in the present example Data Block (DB) the following FIG. 7B, it is assumed for purposes of the 

96Ais illustrated as located on Disk Drive 16Da, Data Block following discussions that the addresses, or locations, of the 

(DB) 96B on Disk Drive l€Db, Data Block (DB) 96C on segments of storage space in Disk Drives 16 are organized 

Disk 16Dc, Data Block (DB) 96D on Disk Drive 16Da so that the logical addresses of Data Blocks 96A through 

again, and so on. 25 96K are located within Logical Unit (LUN) 100a and the 

It will therefore be apparent that, as illustrated in the logical addresses, or locations, of Data Blocks 96M through 

present example, the "length** of each "stripe** across Disk 96Z are located within Logical Unit (LUN) lOOfc. 

Drives 16D, that is, the number of Data Blocks (DBs) 96 in Referring now to FIG. 7B, therein is illustrated the 

each stripe, will be equal to the number of Disk Drives 16D. "mtra-rnirrcring** of the present invention wherein FIG. 7B 

In the present example there are three Disk Drives 16D used 30 follows the same conventions of representation as described 

to store data, so that each stripe across Disk Drives 16 is above with regard to FIG. 7A. As described briefly above, 

three Data Blocks (DBs) 96 long. the mtra-mirroring mechanism of the present invention 

It will also be noted that each of the Data Blocks (DBs) provides mirroring of selected portions of the data stored in 

96 in a stripe is illustrated in FIG. 7A, and in the following Disk Drives 16, usually those portions or types of data that 

FIG. 7B, as occupying the same location, or address, in their 35 are considered by the user to be most critical to the user and 

corresponding Disk Drives 16. For example, Data Blocks thereby for which a higher level of protection from damage 

(DBs) 96A, 96B and 96C arc located at the same locations or loss is desired. 

in Disk Drives 16Da, 16D& and 16Dc, respectively, and so In a presently preferred embodiment of the present 

on. It will be understood by those of ordinary skill in the invention, data is selected for protection by intra-mirroring 

relevant arts, however, that the relationships between the 40 on the basis of assigning selected Logical Units (LUNs) 100 

locations of the Data Blocks (DBs) 96 in each stripe as to be mirrored. That is, one or more Logical Units (LUNs) 

illustrated in FIGS. 7A and 7B is a logical relationship and 100 are selected for mirroring by means of a command 

that the actual physical locations of the Data Blocks (DBs) identified in FIG. 8 as Logical Unit Mirror Select (LU 

96 in a stripe on their corresponding Disk Drives 16 need not Mirror Sel) 102C and the Data Blocks (DBs) 96 containing 

be in a one to one physical relationship so long as, for 45 data that is to be protected by mirroring are assigned Disk 

purposes of the present invention, each sequential Data Drive 16D storage locations within the selected Logical 

Block (DB) 96 in a stripe is located on a different sequential Units (LUNs) 100. In the present example, by way of 

one of Disk Drives 16. It will also be understood by those illustration, Logical Unit (LUN) 100a is selected by the user 

of ordinary skill in the relevant arts that each sequential for protection by mtra-minoring, and all data that is to be 

segment of storage space in a stripe need not actually contain 50 protected by mirroring is assigned to, that is, stored in. Data 

data, that is, some of the Data Blocks (DBs) 96 illustrated in Blocks (DBs) 96 having storage addresses, or locations, 

FIGS. 7A and 7B may be "empty", as may occur, for within Logical Unit (LUN) 100a, that is, in Data Blocks 

example, as data is written into or erased from Disk Drives (DBs) 96A through 96K. It will be appreciated by those of 

16, and mat, as mentioned above, a given body of data may ordinary skill in the relevant arts, however, that other 

occupy one or more Data Blocks (DBs) 96 and may thereby 55 methods may be used to select the data to be protected by 

occupy a part of a stripe or one or more stripes. mtra-mirroring, such as by selected address ranges in one or 

As has been described, it is common for mass storage more Logical Units (LUNs) 100. 

systems to store parity information as well as data and the According to the present invention, each Data Block (DB) 

parity information associated with Data Blocks (DBs) 96 is, 96 that meets the criteria for mtra-rnirroring protection, that 

in turn, stored in Parity Blocks (PBs) 98 that are stored on 60 is, is to be stored in a Logical Unit (LUN) 100 designated for 

Disk Drive 16P wherein, in the present example, each Parity mtra-mirroring, such as Logical Unit (LUN) 100a in the 

Block (PB) 98 contains parity information for and corre- present example, is written into two locations in Disk Drives 

sponding to the data stored in the Data Blocks (DBs) 96 of 16D wherein the two locations are skewed with respect to 

a corresponding stripe of Data Blocks 96 and is often, but one another such that each location resides on a different one 

not necessarily, stored in a location in Disk Drive 16P 65 of Disk Drives 16D. In the present embcrfiment of intra- 

logically corresponding to the locations of the corresponding mirroring, for example, the first or original copy of a Data 

Data Blocks (DBs) 96 in Disk Drives 16Da through 16Dc. Block (DB) 96 will be written into a Disk Drive 16D» and 
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the second or duplicate copy will be written into the next As also indicated in FIG. 8, the allocation of Disk Drives 

sequential Disk Drive 16D#»+1. This process is illustrated in 16 as Disk Drives 16D and 16Pis under control of a system 

FIG. 7B wherein Date Blocks (DBs) 96A through 96C arc user operating through User Input 102. for example, through 

S£ t&'JWSiKE r^rt? * fiT a h ^« l or user ^ nsole ' aUocationTTntered bya 

into Disk Drive UDb and duplicated in Disk Drive 16Dc A- ^ , ^ * ? aU ° Catl ? n as a ^ sk Dnve 16D 

Data Block 96C is written into Disk Mve 16^ and n^f^T^^ s ^°^^^ Units 

duplicated in Disk Drive 16D* and so on. < LUN *> m or °? e 5 subd ™ s ">* s * *sk storage space. 

As further illustrated in FIG. 7B, Parity Blocks (PBs) 98 "P° n ^ implementation of ktra-nmroring, to be 
are generated and stored on Disk Drive 16Pfor the mirrored 10 111111016(1 Disk Allocation Table 104 is read at first system 

Data Blocks (DBs) 96 in the same manner as described initialization by SCDU 100 which, operating through Host 

above, with the parity information stored in each Parity Computer 10, Disk Controller 24 and Disk Platform 28, 

Block (PB) 98 being generated in the same manner regard- formats and configures each of Disks 16 according to its 

less of whether the Data Blocks (DBs) 96 corresponding to allocation. The disk allocation process may thereafter be 
a given Parity Block (PB) 98 are original copies of Data 15 repeated at a later time, for example, as system usage or data 

Block (DBs) 96, duplicate copies of Data Blocks (DBs) 96, storage needs change, whereupon the system will be 

or a mixture of both. re-initialized again. 

As a consequence, it will be apparent to those of ordinary It should be noted that the aliocation of Disk Drives 16 as 

skill in the relevant arts after a study of the illustrative Disk Drives 16D or Disk Drives 16P under user control 
example shown in FIG. 7B and the possible combinations of 20 allows Disks 16 to be re-allocated as system usage or needs 

original data, intra-rmrrored data and parity information change, or as the users gain experience with system 

surviving after any hypothesized single or double disk drive performance, or as disks are added to or removed from the 

failure, that the method and mechanism for intra-mirroring system, so as to obtain the optimum allocation of Disk 

of the present invention will allow the reconstruction of Drives 16. 

intra-mirrored data from the surviving original and duplicate 25 As also illustrated in FIG. 8, Host Computer 10 further 

data and parity information after any single or double disk includes a Demand Paging and Memory Management Facil- 

driye failure. It will also be apparent to those of ordinary ity (DPMM) 106, wherein, as is usual in such systems and 

skill in the relevant arts that the method and mechanism of as is well understood in the arts. Demand Paging and 

the present invention may be readily extended to different Memory Management Facility (DPMM) 106 operates to 

numbers and configurations of disk drives and logical units 30 relate and translate virtual or logical addresses generated by 

and different distributions of the data and parity information the application and operating systems programs into physi- 

in the disk drives, and that the method and mechanism may cal addresses of the corresponding data or program code in 

be expanded to allow recovery from failures of greater the system. As is well understood, the physical addresses 

numbers of disk drives by the addition of further DiskDrives may be of locations in system memory or on Disk Drives 16 

16D and additional parity information. 35 and, if on Disk Drives 16, will include an identification of 

As wiU be described below, mtra-mirroring according to the Logical Unit (LUN) 100 and a segment or segments 

the present invention is implemented in the OS 12* s Tables therein for storing one or more Data Blocks (DBs) 96. As is 

25 in Host Computer 10 and in related facilities in Host well known in the art the details of when and where in the 

Computer 10, including certain associated disk management system addresses are translated from the logical level to the 

and address translation mechanisms found in Host Computer 40 detailed physical level, and the levels of translation per- 

10. It will be recognized by those of ordinary skill in the formed at each step, depends upon the distribution of 

relevant arts that the general functions and structures of "intelligence" and functions among Disk Controller 25, Disk 

these tables and mechanisms are commonly wen known in Platform 28 and Disk Drives 16 in a particular system, 

the art and are found in a wide variety of implementations For these purposes, and as illustrated in FIG. 8, Demand 

in most computer systems and that the present invention 45 Paging and Memory Management Facility (DPMM) 106 

presents instead a new use and operation of such tables and maintains an Address Translation Table 108 storing the 

mechanisms. These tables and mechanisms will therefore addressing information necessary to translate logical to 

not be described in detail below, but will be discussed only physical addresses, and may include a set of Translation 

as sufficient for those of ordinary skill in the arts to under- Lookaside Buffers 110 that operate as a cache mechanism to 

stand and implement the present invention. 50 store previously translated addresses, thereby avoiding the 

As illustrated in FIG. 8, the Operating System (OS) 12 calculation operations necessary to translate a logical 

programs of Host Computer 10 typically include a System address to a physical address upon each reference. 

Disk Configuration Utility (SDCU) 100 which controls and Address Translation Table 108 is represented in FIG. 8 as 

manages the initial configuration and formatting of all Disks storing a Data Bern Entry (DIE) 112 for each data item 

16 connected from Host Computer 10. As is well known in 55 stored in Disk Drives 16 wherein each DIE 112 includes a 

the art, the disk drives connected from a host computer can Data Address Translation Information (DAT!) 114 and Parity 

be and are configured and formatted for selected purposes by Address Translation Information (PATI) 116 containing the 

a system facility such as SDCU 100 at system initialization, information necessary for the logical to physical address 

and may thereafter be reconfigured and reformatted as translation of each read or write reference to a data item 

necessary, again using SDCU 100. In the present exemplary 60 stored in Disk Drives 16, and the corresponding parity 

system, for example, and as described above, Disk Drives 16 information wherein each data item is a body of data stored 

are connected to Host Computer 10 through Disk Platform in one or more Data Blocks (DBs) 96 in Disk Drives 16. 

28 and Disk Controller 24 and SDCU 100 would, in mis As represented in FIG. 8, each Data Address Translation 

implementation, perform the initial configuration and for- Information (DATI) 114 includes an a Logical Unit Identi- 

^f^^ f 3 ^ ^ f Disk Drivcs 16 ti* 00 ^ ™* k Controller 65 fication (LUNID) 118 identifying the Logical Unit (LUN) 

24 and Disk Platform 28 in the manner usual in computer 100 that the data item resides in and, from Disk Allocation 

systcms * Table 104, a Mirror Designator (MD) 120 indicating 
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whether the Logical Unit (LUN) 100 is designated to be In the event of a read operation. Demand Paging and 

rotra-mirrored, a Disk Drive Identification (DDI) 122 iden- Memory Management Facility (DPMM) 106 will operate as 

tifying the particular Disk Drive 16D that at least the first described above with respect to generating Data Block 

Data Block (DB) 96 of the data item resides in, and, for Addresses (DBAs) 126 to non-mirrored Logical Units 

example Lengh Information (U) 124 to identify the num- 5 (LUNs) 100, but will receive rather than provide a Data 

ber of Data Blocks (DBs) 96 m file data item so that Block (DB) 96 with ^ reiated ^ information for each 

S SraS ( } ltem ^ ^ta Block Address (DBA) 126. to me event of a read from 

if will be noted that the identification of the Data Blocks ^^It^^u^ and 
(DBs) 96 comprising a data item can also be identified^ Me f°Z M ^! ment Facik * < D f™> 106 wiU S enerate a 
determined in a number of ways that will be known and 10 read in me manner described above Wlth res P<** *> 
understood by those of ordinary skill in the relevant arts. a wnte address 211(1 wil1 ^sue the address and a read 
Each Data Address Translation Information (DATI) 114 may command to Disk Controller 24, together with a correspond- 
also include, depending upon the particular implementation in S P 3 ^^ a^ess. E &e parity check reveals an unrecov- 
of the mass storage system, the track and sector of the Disk erable error in the data read from the first copy of the Data 
Drive 16D in which the first Data Block (DB) 96 resides. 15 Block ( DB ) Demand Paging and Memory Management 

As also indicated, the related Parity Address Translation Facility (DPMM) 106 will generate a second read address to 

Information (PAIT) 116 is similar in structure and content, the duplicate copy of the Data Block (DB) 96, with a 

but contains the information necessary to locate the parity corresponding parity address, and will thereby read the 

information relating to the referenced Data Block (DB) 96 or mirrored copy of the Data Block (DB) 96. 

Data Blocks (DBs) 96. 20 It is apparent, therefore, that the mtra-mirroring of the 

Upon any Host Computer 10 read or write reference present invention provides the degree of protection from 

pertaining to one or more Data Blocks (DBs) 96, therefore, data loss that is provided by conventional mirroring, but for 

Demand Paging and Memory Management Facility data that is identified and selected by the user as critical and 

(DPMM) 106 will access Address Translation Table 108 and for which additional protection is desirable, and without an 

will read or generate, as necessary, the corresponding Data 25 unacceptable increase in data storage requirements. 

Bern Entry (DIE) 112 to obtain the address information 2. User Selectable Parity Generation 

relating to the Data Blocks (DBs) 96 and will generate a As described briefly above, the mass storage system of the 

sequence of one or more write or read Data Block Addresses present invention further includes a parity mechanism which 

(DBAs) 126 to Disk Controller 24 to identify the locations allows parity information to be generated and stored "off- 

in Disk Drives 16 of one or more Data Blocks (DBs) 96 to 30 line", that is, after the data has been received rather than 

be read or written, together with the corresponding parity while the data is being received, thereby significantly reduc- 

information in Disk Drive 16P. Disk Controller 24 will, in ing the time required to receive and store data in circum- 

turn, direct and control Disk Platform 28 and Disk Drives 16 stances wherein the time available to receive and store data 

as described herein above to perform the requested opera- is a critical factor or limitation in use of the mass storage 

ti °B- 35 system. 

In the event of a write operation to a non-mirrored Logical As has been described above, the exemplary mass storage 

Unit (LUN) 100, each Data Block (DB) 96 comprising the system in which the present invention has been implemented 

data item will be accompanied by a single Data Block includes cache memories for storing data and parity infor- 

Address (DBA) 126 identifying the locations in Disk Drives mation written to and read from Disk Drives 16 and it has 

16 that the Data Block (DB) 96 and an address the related 40 been described that the system may generate parity infor- 

parity information is to be written into, together with a write mation while the corresponding data is written to Disk 

command and the parity information if parity is being Drives 16. It has been described that the parity mechanism, 

written. In the event of a write operation to a mirrored and the diskread/write control mechanisms of Disk Platform 

Logical Unit (LUN) 100, however. Demand Paging and 28 may hold the parity information in the cache memory 

Memory Management Facility (DPMM) 106 will note the 45 associated with the parity disk drive until such time as the 

identification of the Logical Unit)LUN) as a mirrored Logi- data disk drives do not occupy the system buses with data 

cat Unit (LUN) 100 and will generate and provide two Data reads and writes, whereupon the parity information may be 

Block Addresses (DBAs) 126 with each Data Block (DB) written from the cache memory to the parity disk drive. 

96. one ^identifying the location in Disk Drives 16of thefirst. While this method of caching data and parity information 

or original copy of the Data Block (DB) 96 and the second 50 to be written to and read from the disk drives may reduce the 

identifying the location in Disk Drives 16 of the mirror copy typical data write time by approximately 10%. There are 

of the Data Block (DB) 96. Demand Paging and Memory circumstances, however, when it is desirable to reduce the 

Management Facility (DPMM) 106 will also provide a write new data write time even further, as when the system 

command and parity information with a parity address, if receives large amounts of new data at short intervals or when 

parity information is being written. 55 the time to acquire new data is severely limited, as may 

As described above, the Data Block Address (DBA) 126 occur when data is downloaded from a satellite link, 
of the inirror copy of the Data Block (DB) 96 will be According to the present invention, therefore, and by 
generated from the Data Block Address (DBA) 126 of the selection through a user command entered through User 
first copy of the Data Block (DB) 96 by "skewing* 1 the Data Input 102, identified in FIG. 8 as Parity Generate De-Assert 
Block Address (DBA) 126 of the first copy of the Data Block eo (ParityGenDA) 102A, Demand Paging and Memory Man- 
(DB) 96. In the present exemplary embodiment and as agement Facility (DPMM) 106 may be directed to neither 
described above, the Data Block Address (DBA) 126 of the generate nor write parity information when writing data to 
mirror copy of the Data Block (DB) 96 is generated by Disk Drives 16. As such, and while the mass storage system 
incrementing the identification number of the Disk Drive 16 of the present invention will write Data Blocks (DBs) 96 to 
that the first copy of the Data Block (DB) 16 is stored in to 65 Disk Drives 16D in the manner described above, the gen- 
identify the next sequential Disk Drive 16 to receive the eration and writing of parity information will be inhibited, so 
mirror copy of the Data Block (DB) 16. mat no Parity Blocks (PBs) 98 are written to the Disk Drives 



